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3 <110> APPLICANT: Lee, Se Yong et al . 

5 <120> TITLE OF INVENTION: Trehalose Synthase Protein, Gene, 
and A Process 

6 for Producing Trehalose 

8 <130> FILE REFERENCE: 1599-0206P 

10 <140> CURRENT APPLICATION NUMBER: US 09/937,215 

11 <141> CURRENT FILING DATE: 2001-09-24 
_JL3 <160> NUMBER OF SEQ ID NOS : 6 

QL5 <170> SOFTWARE: Patentln version 3.1 
0-7 <210> SEQ ID NO: 1 
LENGTH: 4 753 
TYPE: DNA 

ORGANISM: Pseudomonas stutzeri 
FEATURE : 
NAME/KEY : CDS 
LOCATION: ( 1537 ) . . ( 3 60 3 ) 
OTHER INFORMATION: 



Plasmids, Microorganisms, 



<211> 
<212> 
<213> 
<220> 
<221> 
<222> 
<223> 



&8 
hjL9 
SJ20 
rf§22 
\J23 

IJ2Q <4 00> SEQUENCE: 1 

Q29 gatcgctggc gtactgcagg tagagcaggc gcatcggccc ccagggcgca tcggccggct 60 

CBl ccgctgtgcc ctgctggttc atgaagcgga cgaagcggcc atcgcggaac cgtggacgcc 120 

ftp 3 attcggggct- gtccgggtcg cggctgtcgg tgagcgtgcg ccacaggtcg ctgcgaaacg - 180 

Jg35 gcggaccgct ccaaagcgcg ccgtggatgg gatcgccgag cagttcgtgc agctcccagg 240 

q37 aacgttgcga atgcagcgcg ccgaggctca ggccatgcag atacaggcgc ggtcggcgtt 300 

lI 39 cggccggcag ttcggtccag tagccataga tctcggcgaa tagcgcgcgg gccacgtcgc 360 

ggccgtagtc ggcctccacc agcagcgcca gcgggctgtt cagataggag tactgcaacg 420 

ccacgctggc gatatcgccg tggtgcaggt attccactgc gttcatcgcc gccgggtcga 480 

tccagccggt accggtgggc gtcaccagca ccagcaccga tcgctcgaag gcgccgctgc 540 

gctgcagctc gcgcaaggcc agacgcgccc gctggcgcgg ggtctctgcc gcgcgcagac 600 

cgacgtagac gcgaatcggc tcgagcgccg agcggccgct caagacgctg atatccgccg 660 

ccgacgggcc ggagccgatg aactcgcggc cggtgcggcc cagctcctcc cagcgcagca 720 

acgaggcccg gctgccgctt ttcagcggcg aggccggtgg cgccgtctcc ggttcgatca 780 

gggcgtcgta ctgcgcgaag gatgcgtcca gcatgcgcag tgcccgcgcc gccagcacat 840 

cgctgagcag cgaccagaac agcgccagcg ccaccagcac gccgatcacg ttggccaggc 900 

59 gccgtggcag cacgcggtcg gcgtgccgcg agacgaagcg cgacaccagc cgatacagac 960 

61 gcgccagcgt cagcaggatg agaaaggtcg ccagcgcggt gagaatgact tcgagcaggt 1020 

gcgcactgct caccggcggc atgcccatca gcgcgcgtac cgcgttctgc cagccggcga 1080 

cctggctgag gaaatacccg gccagcagca ggcagccgac cgcgatcagc agattgaccc 1140 

gctcgcgctg ccagcctggg cgctccggca gttccagata gcgccacagc cagcgccaga 1200 

acacgccgag gccatagccc accgccagcg ccgcgccggc cagcacgccc tggctcagcg 1260 

71 tcgagcgcgg cagcagcgat ggcgtcagcg ccgcgcagaa gaacagcgtg cccagcagca 1320 

73 ggccgaaacc ggacagcgag cgccagatat agaggacggg caggtgcagc atgaagatct 1380 

75 ccgcggtcgg gtgacggcgt cgcgcctcgg catatcgagg cgtgtccggt cgtgcggttc 1440 

77 ccgtgatggt ccgcagcagg ccaatccgat gcaacgatgg ccgagcggcc gactcaaacg 1500 

79 tctacatttc cctagtgctg ccggaaccga tcgccg atg age ate cca gac aac 1554 



41. 

43' 

45 

47 

49 

51 

53 

55 

57 



63 
65 
67 
69 
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80 Met Ser lie Pro Asp Asn 

81 15 

83 acc tat ate gaa tgg ctg gtc age cag tec atg ctg cat gcg gec cgc 1602 

84 Thr Tyr lie Glu Trp Leu Val Ser Gin Ser Met Leu His Ala Ala Arg 

85 10 15 20 

87 gag egg teg cgt cat tac gec ggc cag gcg cgt etc tgg cag egg cct 1650 
.88 Glu Arg Ser Arg His Tyr Ala Gly Gin Ala Arg Leu Trp Gin Arg Pro 
89 25 30 35 

91 tat gec cag gec cgc ccg cgc gat gec age gec ate gec teg gtg tgg 1698 
\92 Tyr Ala Gin Ala Arg Pro Arg Asp Ala Ser Ala lie Ala Ser Val Trp 
93 40 45 50 

95 ttc acc gee tat ccg gcg gec ate ate acg ccg gaa ggc ggc acg gta 1746 

96 Phe Thr Ala Tyr Pro Ala Ala lie lie Thr Pro Glu Gly Gly Thr Val 
t-97 55 60 65 70 

99 etc gag gee etc ggc gac gac cgc etc tgg agt gcg etc tec gaa etc 1794 

100 Leu Glu Ala Leu Gly Asp Asp Arg Leu Trp Ser Ala Leu Ser Glu Leu 

101 75 80 85 

p-03 ggc gtg cag ggc ate cac aac ggg ccg atg aag cgt tec ggt ggc ctg 1842 
^Jj.04 Gly Val Gin Gly lie His Asn Gly Pro Met Lys Arg Ser Gly Gly Leu 
yj05 90 95 100 

uJ-07 cgc gga cgc gag ttc acc ccg acc ate gac ggc aac ttc gac cgc ate 1890 
y.08 Arg Gly Arg Glu Phe Thr Pro Thr lie Asp Gly Asn Phe Asp Arg lie 
=3.09 105 110 115 

jj.ll age ttc gat ate gac ccg age ctg ggg acc gag gag cag atg ctg cag 1938 
53.12 Ser Phe Asp lie Asp Pro Ser Leu Gly Thr Glu Glu Gin Met Leu Gin 
y 113 120 125 130 

^115 etc age egg gtg gec gcg gcg cac aac gec ate gtc ate gac gac ate 1986 

UL16 Leu Ser Arg Val Ala Ala Ala His Asn Ala lie Val lie Asp Asp lie 

y3.17 135 140 145 150 

fyL19 gtg ccg gca cac acc ggc aag ggt gec gac ttc cgc etc gcg gaa atg 2034 

jgL20 Val Pro Ala His Thr Gly Lys Gly Ala Asp Phe Arg Leu Ala Glu Met 

155 160 165 

rJ-23 gec tat ggc gac tac ccc ggg ctg tac cac atg gtg gaa ate cgc gag 2082 
T.24 Ala Tyr Gly Asp Tyr Pro Gly Leu Tyr His Met Val Glu lie Arg Glu 
125 170 175 180 

127 gag gac tgg gag ctg ctg ccc gag gtg ccg gec ggg cgt gat teg gtc 2130 

128 Glu Asp Trp Glu Leu Leu Pro Glu Val Pro Ala Gly Arg Asp Ser Val 

129 185 190 195 

131 aac ctg ctg ccg ccg gtg gtc gac egg etc aag gaa aag cac tac ate 2178 

132 Asn Leu Leu Pro Pro Val Val Asp Arg Leu Lys Glu Lys His Tyr lie 

133 200 205 210 

135 gtc ggc cag ctg cag egg gtg ate ttc ttc gag ccg ggc ate aag gac 2226 

136 Val Gly Gin Leu Gin Arg Val lie Phe Phe Glu Pro Gly lie Lys Asp 

137 215 220 225 230 

139 acc gac tgg age gtc acc ggc gag gtc acc ggg gtc gac ggc aag gtg 2274 

140 Thr Asp Trp Ser Val Thr Gly Glu Val Thr Gly Val Asp Gly Lys Val 

141 235 240 245 

143 cgt cgc tgg gtc tat ctg cac tac ttc aag gag ggc cag ccg teg ctg 2322 

144 Arg Arg Trp Val Tyr Leu His Tyr Phe Lys Glu Gly Gin Pro Ser Leu 
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145 








250 










255 










260 








147 


aac 


tgg 


etc 


gac 


ccg 


acc 


ttc 


gcc 


gcg 


cag 


cag 


ctg 


ate 


ate 


ggc 


gat 


2370 


1 A O 

148 


Asn 


Trp 


Leu 


Asp 


Pro 


Thr 


Phe 


Ala 


Ala 


Gin 


Gin 


Leu 


Tin 

lie 


lie 


Gly 


Asp 




149 






265 










270 










275 










151 


gcg 


ctg 


cac 


gcc 


ate 


gac 


gtc 


acc 


ggc 


gcc 


egg 


gtg 


ctg 


cgc 


ctg 


gac 


2418 


1DZ 


Ala 


Leu 


His 


Ala 


Tl A 

lie 


Asp 


Val 


Thr 


Gly 


Ala 


Arg 


vai 


Leu 


Arg 


Leu 


Asp 




153 




280 










285 










290 












155 


gcc 


aac 


ggc 


ttc 


etc 


ggc 


gtg 


gaa 


egg 


cgc 


gcc 


gag 


ggc 


acg 


gcc 


tgg 


2466 


. 156 


Ala 


Asn 


Gly 


Phe 


Leu 


Gly Val 


Glu 


Arg 


Arg 


71 n _ 
Ala 


Glu 


Gly 


Thr 


7. 1 _ 
Ala 


Trp 




-1*57 


295 










3U0 










*i r\ c 

305 










310 




v-i59 


teg 


gag 


ggc 


cac 


ccg 


ctg 


tec 


gtc 


acc 


ggc 


aac 


cag 


ctg 


etc 


gcc 


ggg 


2514 




Ser 


Glu 


Gly 


His 


Pro 


Leu 


Ser 


Val 


Thr 


Gly 


Asn 


Gin 


Leu 


Leu 


Ala 


Gly 




161 










315 










320 










325 






163 


gcg 


ate 


cgc 


aag 


gcc 


ggc 


ggc 


ttc 


age 


ttc 


cag 


gag 


ctg 


aac 


ctg 


acc 


2562 


~\ C A 

164 


Ala 


lie 


Arg 


Lys 


Ala 


Giy 


uiy 


Phe 


Ser 


Phe 


Gin 


Glu 


Leu 


Asn 


Leu 


Thr 




•165 








330 










335 










340 








167 


ate 


gat 


gac 


ate 


gcc 


gcc 


atg 


tec 


cac 


ggc 


ggg 


gcc 


gat 


ctg 


tec 


tac 


2610 


=1169 


lie 


Asp 


Asp 
34 5 


T.I 

lie 


Ala 

Aia 


Ala 

Aia 


Met 


Ser 
350 


HIS 


biy 


uiy 


Ala 


Asp 
355 


Leu 


Ser 


Tyr 






gac 


ttc 


ate 


acc 


cgc 


ccg 


gcc 


tat 


cac 


cat 


gcg 


ttg 


etc 


acc 


ggc 


gat 


2658 




Asp 


Phe 


lie 


Thr 


Arg 


Pro 


Ala 


Tyr 


His 


His 


Ala 


Leu 


Leu 


Thr 


Gly 


Asp 




-173 




O C f\ 










365 










370 












Js ' 5 


acc 


gaa 


ttc 


ctg 


cgc 


atg 


atg 


ctg 


cgc 


gaa 


gtg 


cac 


gcc 


ttc 


ggc 


ate 


2706 


. 1 / b 


Thr 


bill 


Phe 


Leu 


Arg 


Met 


Met 


Leu 


Arg 


blU 


vai 


His 


Ala 


Phe 


Gly 


Tin 

lie 




; 177 


T 7 C 

375 










380 










IOC 

385 










390 




- ± / y 


gac 


ccg 


gcg 


tea 


ctg 


ate 


cat 


gcg 


ctg 


cag 


aac 


cat 


gac 


gag 


ttc 


acc 


2754 


-i on 

= JLoU 


Asp 


Pro 


Ala 


Ser 


Leu 


He 


His 


Ala 


Leu 


Gin 


Asn 


His 


Asp 


Glu 


Pne 


Thr 




o 1 

l_181 










395 










400 










405 






U J- 8 3 


ctg 


gag 


ctg 


gtg 


cac 


ttc 


tgg 


acg 


ctg 


cac 


gcc 


tac 


gac 


cat 


tac 


cac 


2802 


f=Io4 


Leu 


blU 


Leu 


Val 


His 


Phe 


Trp 


Thr 


Leu 


His 


Ala 


Tyr 


Asp 


-His 


Tyr 


His 




.3 o c 

J 








410 










415 










420 








~t87 


tac 


aag 


ggc 


cag 


acc 


ctg 


ccc 


ggc 


ggc 


cac 


ctg 


cgc 


gaa 


cat 


ate 


cgc 


2850 




Tyr 


Lys 


Gly Gin 


Thr 


Leu 


Pro 


(jiy 


Lily 


HIS 


Leu 


Arg 


CjIU 


His 


lie 


Arg 




■ 189 






425 










430 










435 










191 


gag 


gaa 


atg 


tac 


gag 


egg 


ctg 


acc 


ggc 


gaa 


cac 


gcg 


ccg 


tac 


aac 


etc 


2898 


1 Q O 

iy z 


Pi n 

Cj-LU 


bill 


Met 


Tyr 


Glu 


Arg 


Leu 


Thr 


Gly 


Glu 


His 


Ala 


Pro 


Tyr 


Asn 


Leu 








A ACS 
Q ft U 










445 










A ^fi 
4t D U 












195 


aag 


ttc 


gtc 


acc 


aac 


ggg 


gtg 


tec 


tgc 


acc 


acc 


gcc 


age 


gtg 


ate 


gcc 


2946 


196 


Lys 


Phe 


Val 


Thr 


Asn 


Gly Val 


Ser 


Cys 


Thr 


Thr 


Ala 


Ser 


Val 


He 


Ala 




197 


455 










460 










465 










470 




199 


gcg 


gcg 


ctt 


aac 


ate 


cgt 


gat 


ctg 


gac 


gcc 


ate 


ggc 


ccg 


gcc 


gag 


gtg 


2994 


200 


Ala 


Ala 


Leu 


Ssn 


He 


Arg 


Asp 


Leu 


Asp 


Ala 


He 


Gly 


Pro 


Ala 


Glu 


Val 




201 










475 










480 










485 






203 


gag 


cag 


ate 


cag 


cgt 


ctg 


cat 


ate 


ctg 


ctg 


gtg 


atg 


ttc 


aat 


gcc 


atg 


3042 


204 


Glu 


Gin 


He 


Gin 


Arg 


Leu 


His 


He 


Leu 


Leu 


Val 


Met 


Phe 


Asn 


Ala 


Met 




205 








490 










495 










500 








207 


cag 


ccc 


ggc 


gtg 


ttc 


gcc 


etc 


tec 


ggc 


tgg 


gat. 


ctg 


gtc 


ggc 


gcc 


ctg 


3090 


208 


Gin 


Pro 


Gly Val 


Phe 


Ala 


Leu 


Ser 


Gly 


Trp 


Asp 


Leu 


Val 


Gly 


Ala 


Leu 





209 505 510 515 
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211 


ccg 


ctg 


gcg 


ccc 


gag 


cag 


gtc 


gag 


cac 


ctg 


atg 


ggc 


gat 


ggc 


gat 


acc 


3138 


212 


Pro 


Leu 


Ala 


Pro 


Glu 


Gin 


Val 


Glu 


His 


Leu 


Met 


Gly 


Asp 


Gly 


Asp 


Thr 




213 




520 












525 










530 












215 


cgc 


tgg 


ate 


aat 


cgc 


ggc 


ggc 


tat 


gac 


etc 


gee 


gat 


ctg 


gcg 


ccg 


gag 


3186 


216 


Arg 


Trp 


lie 


Asn 


Arg 


Gly 


Gly 


Tyr 


Asp 


Leu 


Ala 


Asp 


Leu 


Ala 


Pro 


Glu 




217 


535 










540 










545 










550 




219 


gcg 


teg 


gtc 


tec 


gec 


gaa 


ggc 


ctg 


ccc 


aag 


gee 


cgc 


teg 


ctg 


tac 


ggc 


3234 


220 


Ala 


Ser 


Val 


Ser 


Ala 


Glu 


Gly 


Leu 


Pro 


Lys 


Ala 


Arg 


Ser 


Leu 


Tyr 


Gly 




221 










555 












560 










565 






223 


age 


ctg 


gee 


gag 


cag 


ctg 


cag 


egg 


cca 


ggc 


tec 


ttc 


gee 


tgc 


cag 


etc 


3282 


224 


Ser 


Leu 


Ala 


Glu 


Gin 


Leu 


Gin 


Arg 


Pro 


Gly 


Ser 


Phe 


Ala 


Cys 


Gin 


Leu 




225 








570 












575 










580 








227 


aag 


cgc 


ate 


etc 


age 


gtg 


cgc 


cag 


gec 


tac 


gac 


ate 


get 


gec 


age 


aag 


3330 


228 


Lys 


Arg 


lie 


Leu 


Ser 


Val 


Arg 


Gin 


Ala 


Tyr 


Asp 


He 


Ala 


Ala 


Ser 


Lys 




229 






585 












590 










595 










231 


cag 


ate 


ctg 


att 


ccg 


gat 


gtg 


cag 


gcg 


ccg 


gga 


etc 


ctg 


gtg 


atg 


gtc 


3378 


232 


Gin 


lie 


Leu 


lie 


Pro 


Asp 


Val 


Gin 


Ala 


Pro 


Gly 


Leu 


Leu 


Val 


Met 


Val 




(283 




600 












605 










610 












<JB5 


cac 


gag 


ctg 


cct 


gee 


ggc 


aag 


ggc 


gtg 


cag 


etc 


acg 


gca 


ctg 


aac 


ttc 


3426 




His 


Glu 


Leu 


Pro 


Ala 


Gly 


Lys 


Gly 


Val 


Gin 


Leu 


Thr 


Ala 


Leu 


Asn 


Phe 




qpi 


615 










620 










625 










630 






age 


gee 


gag 


ccg 


gtc 


age 


gag 


ace 


ate 


tgc 


ctg 


ccc 


ggc 


gtg 


gcg 


ccc 


3474 




Ser 


Ala 


Glu 


Pro 


Val 


Ser 


Glu 


Thr 


He 


Cys 


Leu 


Pro 


Gly 


Val 


Ala 


Pro 




= ?41 










635 












640 










645 






ft 43 


ggc 


ccg 


gtg 


gtg 


gac 


ate 


att 


cac 


gag 


agt 


gtg 


gag 


ggc 


gac 


etc 


acc 


3522 




Gly 


Pro 


Val 


Val 


Asp 


He 


He 


His 


Glu 


Ser 


Val 


Glu 


Gly 


Asp 


Leu 


Thr 




= ; 24 5 








650 












655 










660 








ffl47 


gac 


aac 


tgc 


gag 


ctg 


cag 


ate 


aac 


etc 


gac 


ccg 


tac 


gag 


ggg 


ctt 


gec 


3570 


^8 


Asp 


Asn 


Cys 


Glu 


Leu 


Gin 


He 


Asn 


Leu 


Asp 


Pro 


Tyr 


Glu 


Gly 


Leu 


Ala 




l = g|4 9 






665 












670 










675 










^51 


ctg 


cgt 


gtg 


gtg 


age 


gee 


gcg 


ccg 


ccg 


gtg 


ate 


tgagcgccct cttcgcgcgc 


3623 




Leu 


Arg 


Val 


Val 


Ser 


Ala 


Ala 


Pro 


Pro 


Val 


He 


















680 












685 






















255 


cccgggtccg ccgctatagt 


gcgcagcgcc 


tggggegege 


attgccctcg ccgtcgagac 


3683 


2 57 


cagcccgtgt cgttcacttc 


gcttttccgc 


cttgcgctgc 


tgccgctggc gctgcttgcc 


3743 


259 


gcacccgtct gggegcagae 


cgcctgcccg 


cccggccagc 


agecgatctg cctgagcggc 


3803 


261 


agctgcctct gcgtgccggc 


cgccgccagc 


gatccacagg 


eggtctaega ccgcgtccag 


3863 


263 


cgtatggcta cgctggccct 


gcagaactgg 


atccagcagt 


cgcgcgaccg cctgatggcc 


3923 


265 


ggeggegteg agccgatacc 


gctgcacatc 


cgctcgcagc 


tegagcegta tttcgatctt 


3983 


267 


gccgtgctgg agagtgcgcg 


gtaccgcgtc 


ggegacgagg 


tggtgctgac tgccggcaac 


4043 


269 


accctgctgc gcaacccgga 


cgtcaatgcc 


gtgaccctga 


tegaegtcat cgtcttccgc 


4103 


-271 


cacgaggagg atgcccggga 


caacgtcgcg 


ctctgggccc 


atgagctcaa geaegtcgag 


4163 


273 


caatatctgg actggggcgt 


cgccgagttc 


gcccggcgct 


ataegcagga tttccgtgcc 


4223 


275 


gtggagcgcc cggcctatgc 


gctggagcgt 


gaggtggaag 


aggccctgcg egagaegcag 


4283 


277 


acgcggcgct gagegagctg 


atcggtgctg 


ctgcccgcac 


tgggctgaag cccaccaatg 


4343 


279 


acgccggcga aaacgaaaaa 


ccccgccgag 


gcggggtttc 


tgacgcgggt tgtgcggtca 


4403 


281 


gctcagaacg ccgggaccac 


ggcgcccttg 


tacttttcct 


cgatgaactg gcgtacttgc 


4463 


283 


tcgctgtgca gcgcggcagc 


cagtttctgc 


atggcatcgc 


tgtccttgtt gtceggaegg 


4523 


285 


gcgaccagaa tgttcacgta 


tggegagteg 


ctgccctcga 


tcaccagggc gtcctgggtc 


4583 
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287 


gggttcagct 


tggcttccag cgcgtagttg gtgttgatca 


gcgccaggtc gacctgggtc 


4643 


289 


agcacgcgcg < 


gcagagtcgc ggcttccagt tcgcggatct 


tgatcttctt cgggttctcg 


4703 


291 


gcgatgtctt 


cggcgtggcg gtgatgccgg cgccgtcctt 


cagaccgatc 






4753 


294 


<210> SEQ ID NO 


: 2 


























295 


<211> LENGTH: 689 


























296 


<212> TYPE: 


PRT 




























297 


<213> ORGANISM: 


Pseudomonas 


stutzeri 
















299 


<400> SEQUENCE: 


2 


























301 


Met 


Ser 


lie 


Pro 


Asp 


Asn 


Thr 


Tyr 


He 


Glu 


Trp 


Leu 


Val 


Ser 


Gin 


Ser 




302 


1 








5 










10 










15 






'305 


Met 


Leu 


His 


Ala 


Ala 


Arg 


Glu 


Arg 


Ser 


Arg 


His 


Tyr 


Ala 


Gly 


Gin 


Ala 




v?S06 








20 
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